Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

MEDIA: a semantically annotated corpus of task oriented dialogs in French

Identifieur interne : 003930 ( Main/Exploration ); précédent : 003929; suivant : 003931

MEDIA: a semantically annotated corpus of task oriented dialogs in French

Auteurs : Hélène Bonneau-Maynard [France] ; Matthieu Quignard [France] ; Alexandre Denis [France]

Source :

RBID : ISTEX:62FE38FFD0D92679441FEF1AFB36859AC3BC3A98

Descripteurs français

English descriptors

Abstract

Abstract: The aim of the French Media project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.

Url:
DOI: 10.1007/s10579-009-9103-2


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">MEDIA: a semantically annotated corpus of task oriented dialogs in French</title>
<author>
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
</author>
<author>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
</author>
<author>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:62FE38FFD0D92679441FEF1AFB36859AC3BC3A98</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/s10579-009-9103-2</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-VRS28KX7-6/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001701</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">001701</idno>
<idno type="wicri:Area/Istex/Curation">001682</idno>
<idno type="wicri:Area/Istex/Checkpoint">000A37</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000A37</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00424619</idno>
<idno type="url">https://hal.inria.fr/inria-00424619</idno>
<idno type="wicri:Area/Hal/Corpus">003005</idno>
<idno type="wicri:Area/Hal/Curation">003005</idno>
<idno type="wicri:Area/Hal/Checkpoint">002F30</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">002F30</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:Area/Main/Merge">003A08</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Francis:10-0023530</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000255</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000772</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000238</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000238</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:Area/Main/Merge">003C72</idno>
<idno type="wicri:Area/Main/Curation">003930</idno>
<idno type="wicri:Area/Main/Exploration">003930</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">MEDIA: a semantically annotated corpus of task oriented dialogs in French</title>
<author>
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LIMSI–CNRS, Université Paris-Sud 11, Bât. 508, BP 133, 91403, Orsay Cedex</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Orsay</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, 54506, Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, 54506, Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Language Resources and Evaluation</title>
<title level="j" type="abbrev">Lang Resources & Evaluation</title>
<idno type="ISSN">1574-020X</idno>
<idno type="eISSN">1574-0218</idno>
<imprint>
<publisher>Springer Netherlands</publisher>
<pubPlace>Dordrecht</pubPlace>
<date type="published" when="2009-12-01">2009-12-01</date>
<biblScope unit="volume">43</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="329">329</biblScope>
<biblScope unit="page" to="354">354</biblScope>
</imprint>
<idno type="ISSN">1574-020X</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1574-020X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Annotation</term>
<term>Assessment</term>
<term>Computational linguistics</term>
<term>Corpus</term>
<term>Corpus annotation</term>
<term>Dialog system</term>
<term>Evaluation</term>
<term>French</term>
<term>Speech processing</term>
<term>Speech understanding</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Annotation de corpus</term>
<term>Evaluation</term>
<term>Français</term>
<term>Linguistique informatique</term>
<term>Traitement automatique de la parole</term>
</keywords>
<keywords scheme="mix" xml:lang="en">
<term>Annotation</term>
<term>Corpus</term>
<term>Dialog system</term>
<term>Evaluation</term>
<term>Speech understanding</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: The aim of the French Media project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
<li>Île-de-France</li>
</region>
<settlement>
<li>Orsay</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Île-de-France">
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
</region>
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003930 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 003930 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:62FE38FFD0D92679441FEF1AFB36859AC3BC3A98
   |texte=   MEDIA: a semantically annotated corpus of task oriented dialogs in French
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022